- Title
- SPPlagiarise: A Tool for Generating Simulated Semantics-Preserving Plagiarism of Java Source Code
- Creator
- Cheers, Hayden; Lin, Yuqing; Smith, Shamus P.
- Relation
- 2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS). Proceedings of 2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS) (Beijing, China 18-20 October, 2019) p. 617-622
- Publisher Link
- http://dx.doi.org/10.1109/icsess47205.2019.9040853
- Publisher
- Institute of Electrical and Electronics Engineers (IEEE)
- Resource Type
- conference paper
- Date
- 2019
- Description
- Source code plagiarism is a common occurrence in undergraduate computer science education. Studies have indicated at least 50% of students plagiarize during their undergraduate career. To identity cases of source code plagiarism, many source code plagiarism detection tools have been proposed. However, conclusively determining the effectiveness these tools at identifying cases of source code plagiarism is difficult. Evaluations are typically performed using unreleased data sets. Without a comprehensive publicly available data set for source code plagiarism detection evaluation, it is difficult to perform an unbiased and reproducible evaluations of tools. To address this problem, this paper presents a tool, SPPlagiarise, which is designed to produce simulated source code plagiarism of Java source code. SPPlagiarise applies a random number of semantics-preserving source code obfuscations at random locations to a Java code base to simulate source code plagiarism. In this paper the design of the tool and an evaluation of a generated plagiarism data set is presented.
- Subject
- code obfuscation; source code plagiarism; semantics preserving clone
- Identifier
- http://hdl.handle.net/1959.13/1460374
- Identifier
- uon:45951
- Identifier
- ISBN:9781728109459
- Language
- eng
- Reviewed
- Hits: 605
- Visitors: 599
- Downloads: 0